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! . A method for representing the structure of a polyketide produced by a modular 
polyketide synthase, said method comprising the steps of: 

(a) defining a set of monomer units of which said polyketide is 

composed, 

(b) assigning an alphanumeric symbol or symbols to each different 

monomer unit in said set, 

(c) identifying one or more monomers in said set that is present m said 

polyketide, and 

(d) composing a stnng of said symbols ordered in a manner reflectmg 
the order in which sa.d monomers occurs m said polyketide, wherein said stnng 
of symbols represents the structure of said polyketide. 

2 The method of claim 1, wherein said monomer set comprises two-carbon unit 
monomers, wherein a first carbon of satd un„ ,s substituted with hydrogen or methyl, and 
a second carbon of said unit is substituted with oxygen, hydroxy, or hydrogen, and satd 
two carbon unit compnses either a s.ngle or a double bond between said first and second 
carbons. 

3 The method of claim 2, wherein said monomer set addttionally compnses one or 
more members selected from the group consistmg of two carbon u„,t monomers m wh.ch 
sa,d firs, carbon U substituted with hydroxy, methoxy, or ethyl; a motety correspond 
,o an ammo ac.d or am.no acd derivative incorporated tnto a PKS by a non-nbosomal 
peptide synthase; a motety correspondmg to a structure incorporated tnto a po.yket.de by 
an AMP l.gase or a CoA ligase; and a mo.ety corresponding to a stmcture correspondmg 
,„ a structure in a polyketide after mod.ficat.on by a polyket.de modificat.on enzyme. 



41 



4. The method of claim 2 wherein .he set of monomer unit and corresponding 
symbol comprises: 




: A; 



= B; 



= C; 



.=D; 



= E; 



. = F: 




J; 



K; 



L; 



= M ; and 



5. The method of claim 4 wherein the set of monomer unit further comprises a 
miscellaneous monomer that is assigned the symbol Q. 
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6 . The method of clatm 4 wherein the se, of monomer unit and correspond^ 

symbol further comprises 

OH OH Q H 

r ft R 

^=D'; J^/=G'; JL^-H'; 

R R A 

/-y = J'; = K' ; and /"Y" = M' 

R ft R 

7 A database of polyketides, in whteh eaeh satd member is represented by a stnng 
of alpha-numenc symbols, wherem said symbols represent structnra. subuntts of satd 
polykettde, and said stnng represents the order in whieh sueh subnntts oecur » satd 
polyketide. 

8 . The database of claim 7 that includes at least 100 different polyketides. 

9 . The database of claim 7 wherein each said member is represented by a 
CHUCKLES string. 

10 . The database of claim 7 wheretn each said member ts represented by an annotated 

CHUCKLES string. 
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11. The database of claim 7 wherein 
subunit are selected from the group consisting of 



the symbol and its corresponding structural 



A = 



; B = 



; c = 



; D = 



E = 



; F = 



; G = 



; h = 



; L 



and Q for a miscellaneous monomer. 
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n. The database of claim 7 wherein the symbol and its corresponding structural 

subunit are selected from the group consisting of 

oh 9H oh ^ 



B = 



; f = 



; G = 



; h = 



; L = 



M = 



N = 



; A' = 



; B' 



•; M' 



c = /Y ; ^' = 

R 

R R 

and Q for a miscellaneous monomer. 



13. A database of polyketides, m which each said member is represented by a 
linearized representation of said polyketide. 
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,4. A method of des.gning a PKS gene capable of producmg a desired polyketide, 

which method comprises: 

(a) defining a string of alphanumeric symbols representing the structure of 

said polyketide, 

(b ) companng said stnng to a database of stnngs of alphanumeric symbols 
representing polyketides produced by PKS genes, 

(c) identifying common elements in said string representing the structure of 
said polyketide with elements in said strings in said database, and 

(d) generating one or more new stnngs from elements identified in step (b) 
that match said stnng representing the structure of said polyketide, wherein said new 
stnng defines a PKS gene capable of producing said polyketide. 

15. The method of claim 14, wherein all possible PKS genes encoding a desked 
polyketide from said database are generated and displayed. 

16. The method of claim 14, wherein said new stnngs generated in step (d) are rated 
and displayed in an order based on one or more parameters. 
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, , The method of claim 16, wherein said parameters are selected from the group 
consisting of number of non-native module interfaces and number of non-native protein 
interfaces. 
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